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Amendments to the Claim 

The following listing of claims will replace all prior versions and listings of claims in the 
application: 

Listing of Claims 

1 . (Canceled) 

2. (Currently Amended) A method for constructing a repeat database within a single species 
comprising: 

selecting a query sequence; 
selecting known repeat sequences; 

adding known repeat sequences into a repeat sequence database; 
masking said query sequence with repeat sequences in the repeat sequence database to 
create a contig assembly ; 

testing said masked query sequence with a redundant file; 

identifying sequences in the redundant file that contain a similar sequence to a portion of 
the query sequence, wherein said identified sequences and said similar portion of the query 
sequence make up a pairwise sequence alignment; 

aligning all the identified pairwise sequence alignments; 

designafing the right and left endpoints of each identified sequence and any intervening 
sequences; 

identifying a position within the query sequence corresponding to each endpoint; 

defining regions within the query sequence, wherein a region is a sequence between two 
consecutive positions matching two endpoints; 

identifying any two successive regions having a large variance in the number of sequence 
matches; and 

adding the sequence within the region of the two successive regions having the highest 
number of sequence matches into the repeat sequence database. 
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3. (Original) The method of claim 2, wherein the large variance in the number of sequence 
matches is equal to 5 or more. 

4. (Canceled) 

5. (Previously Presented) The method of claim 2, wherein said query sequence is a 
deoxyribonucleotide sequence. 

6. (Previously Presented) The method of claim 2, wherein said query sequence is a 
ribonucleotide sequence. 

7. (Previously Presented) The method of claim 2, wherein said query sequences are derived 
from animal DNA or RNA. 

8. (Original) The method of claim 7, wherein said animal is a human. * 

9. (Original) The method of claim 8, wherein said animal is a mouse. 

10. -15. (Canceled) 

16. (Previously Presented) The method of claim 2, wherein said repeat sequences are 
postulated based upon amino acid sequences. 

17. (Canceled) 

18. (Original) The method of claim 2, wherein said database is encoded in a written medium. 

19. (Original) The method of claim 2, wherein said database is encoded in an electronic 
medium. 

20. (Original) The method of claim 19, wherein said electronic medium is a computer- 
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readable medium. 

2 1 . (Original) The method of claim 20, wherein said computer-readable medium is 
addressable through an internet connection. 

22. (Original) The method of claim 2, wherein said redundant file is a Public Domain 
Database. 

23. (Original) The method of claim 22, wherein said Public Domain Database is GenBank. 

24. (Original) The method of claim 22, wherein said Public Domain Database is dbEST. 

25. (Original) The method of claim 22, wherein said Public Domain Database is TIGR. 

26. (Original) The method of claim 22^ wherein said Public Domain Database is SwissProt. 

27. (Original) The method of claim 2, wherein sequence comparisons are carried out using a 
Database Search Algorithm. 

28. (Original) The method of claim 27, wherein said Database Search Algorithm is BLAST. 

29. (Original) The method of claim 27, wherein said Database Search Algorithm is FASTA. 

30. (Original) The method of claim 27, wherein said Database Search Algorithm is Smith- 
Waterman. 

3 1 . (Original) The method of claim 2, wherein said sequence comparisons are carried out 
utilizing a Scoring Matrix Program. 

32. (Original) The method of claim 3 1 , wherein said Scoring Matrix Program is PAM. 
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33. (Original) The method of claim 3 1 , wherein said Scoring Matrix Program is BLOSUM. 
34- 38. (Canceled) 

39. (Withdrawn) An improved database of nucleotide sequences, the improvement consisting 
of repeat sequences containing a similar sequence to a portion of a query sequence, wherein said 
identified sequences and said similar portion of the query sequence make up a pairwise sequence 
alignment, and wherein all identified pairwise sequence alignments have right and left endpoints 
of each identified sequence and any intervening sequences. 
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